On Synchronous, Asynchronous, and Randomized Best-Response schemes for computing equilibria in Stochastic Nash games
نویسندگان
چکیده
In this work, we consider a stochastic Nash game in which each player solves a parameterized stochastic optimization problem. In deterministic regimes, best-response schemes have been shown to be convergent under a suitable spectral property associated with the proximal best-response map. However, a direct application of this scheme to stochastic settings requires obtaining exact solutions to stochastic optimization problems at each iteration. Instead, we propose an inexact generalization of this scheme in which an inexact solution to the best-response problem is computed via an increasing number of projected stochastic gradient steps. On the basis of this framework, we present three inexact best-response schemes: (i) First, we propose a synchronous inexact best-response scheme where all players simultaneously update their strategies; (ii) Subsequently, we extend this to a randomized setting where a subset of players is randomly chosen to their update strategies while the other players keep their strategies invariant; (iii) Finally, we propose an asynchronous scheme, where each player determines its own update frequency and may use outdated rival-specific data in updating its strategy. Under a suitable contractive property of the proximal best-response map, we proceed to derive a.s. convergence of the iterates for (i) and (ii) and mean-convergence for (i) – (iii). In addition, we show that for (i) – (iii), the produced iterates converge to the unique equilibrium in mean at a prescribed linear rate with a prescribed constant rather than a sub-linear rate. Finally, we establish the overall iteration complexity of the scheme in terms of projected stochastic gradient steps for computing an −Nash equilibrium and note that in all settings, the iteration complexity is O(1/ ) where c = 0 in the context of (i) and represents the positive cost of randomization (in (ii)) and asynchronicity and delay (in (iii)). The schemes are further extended to settings where players solve two-stage recourse-based stochastic Nash games with linear and quadratic recourse. Finally, we apply the developed methods to a multi-portfolio investment problem, and carry out numerical simulations to demonstrate the rate and complexity statements.
منابع مشابه
Asynchronous Schemes for Stochastic and Misspecified Potential Games and Nonconvex Optimization
The distributed computation of equilibria and optima has seen growing interest in a broad collection of networked problems. We consider the computation of equilibria of convex stochastic Nash games characterized by a possibly nonconvex potential function. In fact, any stationary point of the potential function is a Nash equilibrium of the associated game. Consequently, there is an equivalence b...
متن کاملImpact of Asynchrony on the Behavior of Rational Selfish Agents
The behavior of rational selfish agents has been classically studied in the framework of strategic games in which each player has a set of possible actions, players choose actions simultaneously and the payoff for each player is determined by the matrix of the game. However, in many applications, players choose actions asynchronously, and simultaneity of this process is not guaranteed: it is po...
متن کاملComputing Nash Equilibria for Stochastic Games
We present two new algorithms for computing Nash equilibria of stochastic games. One is a global random start algorithm based on nonlinear programming, while the other combines a dynamical system with nonlinear programming to find stable equilibria. Promising numerical results are presented.
متن کاملThe Stochastic Response Dynamic: A New Approach to Learning and Computing Equilibrium in Continuous Games
This paper describes a novel approach to both learning and computing Nash equilibrium in continuous games that is especially useful for analyzing structural game theoretic models of imperfect competition and oligopoly. We depart from the methods and assumptions of the traditional “Calculus” approach to computing equilibrium. Instead, we use a new and natural interpretation of games as condition...
متن کاملA Study of Gradient Descent Schemes for General-Sum Stochastic Games
Zero-sum stochastic games are easy to solve as they can be cast as simple Markov decision processes. This is however not the case with general-sum stochastic games. A fairly general optimization problem formulation is available for general-sum stochastic games by Filar and Vrieze [2004]. However, the optimization problem there has a non-linear objective and non-linear constraints with special s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017